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CONVEXIFYING POSITIVE POLYNOMIALS 
AND SUMS OF SQUARES APPROXIMATION 

KRZYSZTOF KURDYKA, STANISLAW SPODZIEJA 


Abstract. We show that if a polynomial / G R[a:i,... ,Xn] is nonnegative 
on a closed basic semialgebraic set X = {x £ : gi{x) > 0,.. . ,gr{x) > 0}, 

where gi,... ,gr- G Rfxi,... then / can be approximated uniformly on 

compact sets by polynomials of the form (tq + ip(gi)gi + ■ ■ • + g>{gr)gr, where 
CTO G R[xi,..., Xn] and g} G R[i] are sums of squares of polynomials. In 
particular, if X is compact, and h(x) := R? — \x\^ is positive on X, then 
/ = CTO + CTih + ip{gi)gi + ■ ■ ■ + ip(gr)gr for some sums of squares ctq, cti G 
R[xi,..., Xn] and G R[t], where \x\^ = xf + ■ ■ ■ + x'^. We apply a quantita¬ 
tive version of those results to semidefinite optimization methods. Let X be 
a convex closed semialgebraic subset of R" and let / be a polynomial which 
is positive on X. We give necessary and sufficient conditions for the existence 
of an exponent TV G N such that (1 + \x\^)^ f{x) is a convex function on X. 
We apply this result to searching for lower critical points of polynomials on 
convex compact semialgebraic sets. 


Introduction 

In the paper we study two types of problems for polynomials which are positive 
(or nonnegative) on subsets of R". In the first part we prove stronger versions of 
known approximation and representation theorems with sums of squares of poly¬ 
nomials. Next we give quantitative versions of these results and explain some 
applications to semidefinite optimization methods. In the second part we prove 
that any polynomial / which is positive on a convex closed set X becomes strongly 
convex when multiplied by (1 -f |a:p)^ with N large enough (the noncompact case 
requires some extra assumptions). In fact we give an explicit estimate for N, which 
depends on the size of the coefficients of / and on the lower bound of / on A. As 
an application of our convexification method we propose an algorithm which for a 
given polynomial / on a compact semialgebraic set X produces a sequence (starting 
from an arbitrary point in X) which converges to a critical point of / on A. We 
also relate convexity and positivity issues. 

O.I. Notation and state of the art. We denote by R[j:] or ..., Xn] the ring 
of polynomials in a: = (xi ,... ,Xn) with coefficients in R. Important problems of 
real algebraic geometry are representations of nonnegative polynomials on closed 
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semialgebraic sets. Recall Hilbert’s 17th problem (solved by E. Artin [2]): if / S 
K[a;] is nonnegative on R", then 

(AH) fh^ = h1-\ - + for some h,hi,... ,hm S ]R[x], h ^ 0, 

that is, / is a sum of squares of rational functions. With the additional assumptions 
that / is homogeneous and f{x) > 0 for a; 0, B. Reznick |241 Theorem 3.12] 
proved that there exists an integer rg such that for any N > rg the polynomial 
{x1 + • • • + x^)^ f{x) is a sum of even powers of linear functions. 

Let X C R" be a closed basic semialgebraic set defined hy gi,^ gr G R[a;], i.e., 

(0.1) W = {a; G R" : gi{x) > 0,... ,3r(a;) > 0}. 

The preordering generated by gi,..., g^ is defined to be 

Tigi,...,gr) = [ ••• 5 ®'- : cTe G ^R[a;]^ for e G {0,1}’'! , 

where ^R[a;]^ denotes the set of sums of squares (s.o.s.) of polynomials from 
R[a;]. Natural generalizations of the above theorem of Artin are the Stellensatze of 
J.-L. Krivine [12], D. W. Dubois |9|, and J.-J. Risler [26] (see also (3). For refer¬ 
ences and a more detailed discussion of this subject see for instance [28], |2^, [22| . 
When the set X is compact, a very important result was obtained by K. Schmiidgen 
(see m, 0): every strictly positive polynomial f on X belongs to the preorder¬ 
ing T{gi,..., gr). M. Schweighofer [30| studied degree bounds in the Schmiidgen 
Positivstellensatz representation 

/= ^ cTegt^ ■■■gr^ GT{gi,...,gr). 

e€{0,iy 

He obtained an upper bound for deg Ceg^^ ■ ■ ■ gp' in terms of deg /, /* := min{/(a;) : 
X G X} and the coefficients of /, provided that /* > 0. As shown by C. Scheiderer 
m, there is no such bound in terms of deg/ unless dim(A) < 1. Under some 
additional assumptions M. Putinar |23| proved that / belongs to the quadratic 
module generated by gi,... ,gr, 

P{gi, ...,gr) ■= jcTo -I- CTigi H-1- arPr ■ o'* G ^R[a;]^, i = 0,... ,r|. 

The above results concern strictly positive polynomials. In the case of nonnegative 
polynomials C. Berg, J. P. R. Christensen and P. Ressel |4] and J. B. Lasserre and 
T. Netzer m Corollary 3.3] proved that any polynomial / which is nonnegative 
on [—1,1]” can be approximated in the Zi-norm by sums of squares of polynomials. 
The Zi-norm of a polynomial is defined to be the sum of the absolute values of its 
coefficients (in the usual monomial basis). Hence we have 

Fact 1. If a polynomial f G R[a;] is nonnegative on ]—R, i?]", i? > 0, then the 
polynomial f{Rx) can be approximated in the li-norm by sums of squares of poly¬ 
nomials. In particular f{x) can he uniformly approximated on ]—R, R]” by sums of 
squares of polynomials. 

D. Hilbert m proved that for n > 2 there are nonnegative polynomials on 
R" which are not sums of squares of polynomials. T. S. Motzkin [21] gave an 
explicit example of such a polynomial, f{xi,X 2 ) = 1-1- -I-— 3), i.e., in 

the representation (lAHIl of / the degree of h must be positive. So in general the 
Schmiidgen Positivstellensatz does not hold on noncompact sets. For a polynomial 
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/ positive on a noncompact set X the problem arises of approximation of / by 
elements of the preordering T{gi ,..., gr) or of the quadratic module P{gi, ■ ■ ■, gr)- 
In this connection J. B. Lasserre [m Theorem 2.6] (see also [T^) proved that if 
gi,... ,gr are concave polynomials such that gi (z) > 0 ,... gr{z) > 0 for some 2 £ X, 
then any convex polynomial nonnegative on X can be approximated in the /i-norm 
by polynomials from the set 

Lcigi, ...^gr) ■■= |cro + \\gi H-h \lgr ■ cto G ^K[a;]^ convex, Ai,..., Ar- G k|. 

For X = K.” the approximation is uniform on compact sets. J. B. Lasserre [T 6 ] 
proved that if a polynomial / G M[a;] has a global minimum /* > 0 then for every 

2fc 

e > 0 there is G N such that the polynomial := / + £ X]fc=i Sj=i is a sum 
of squares (see also [18] for polynomials on real algebraic sets). 

0.2. Our contributions. In this article, we prove an analogue of the Schmiidgen 
and Putinar theorems for a smaller cone. Namely for 5 G M[a;] we put 

■ ,5r) := |cto + aig + (p{gi)gi H-h ^p{gr)gr ■ ^ 0,^1 S 

where t is a single variable. Note that if we set 

d>(gi,... , 5 ^) := \g}{gi)gi +-h <p(gr-)ffr : € ^]R[t] 2 |, 

then 

K.{g,gi,...,gr) = T{g) + <^{gi,... ,gr), 

where A^B = {a + h : a & A, b € B}. In Section [1] we prove lTheorem ll.ll) that for 
a closed basic semialgebraic set X defined by gi,... , 3 ^ G R[a;] and a polynomial 
/ G R[a;] the following conditions are equivalent: 

(i) / is nonnegative on X, 

(ii) / can be uniformly approximated on compact sets by polynomials from the 
cone 

S{gi, ...,g^):= + $(gi,... , 5 ^). 

Moreover, / can be approximated by polynomials from 5 (^ 1 ,..., gr) in the 
li-norm. 

In particular, if X is a compact set and g{x) := E? — |xp > 0 for x G X, then (see 
Corollary 1 2. 1(1 

(0.2) / is strictly positive on X =)> f G X{g, gi,..., gr)- 

0.3. Application to optimization. In [T5| Lasserre gave a method of minimizing 
a polynomial / on a compact basic semialgebraic set X of the form (iniD. More 
precisely, let 

r := inf{/(x) : X G X}. 

Then f* = sup{a G R : /(x) — a > 0 for x G X}, and by Putinar’s result |23j . 
f* = sup{a G R : / - a G P{gi ,..., gr)}, 

or equivalently 

f* = inf{L(/) : L : R[x] —>• R is linear, L(l) = 1, L{P{gi ,..., gr)) C [0, 00 )}. 
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Denote 

Pkigi ,... ,gr) := jcTogo H-h (Jrgr e Pigi, ■■■,gr) ■ degcngi < fc, i = 0,... ,r|, 

where we set go = 1- Lasserre considered the following optimization problems: 

maximize a G R : / — a G Pk{gi, ■ ■ ■ ,gr), 

minimize L{f) for L : R[x]fc —>■ R, linear, L(l) = 1, L{Pk{gi,... ,gr)) C [0, oo), 
where R[x]fc is the linear space of polynomials h £ R[x] such that degh < k. Set 

Ofc := sup{a £ R : / - a G Pfe(gi,.. ■,gr)}, 

II := inf{X(/) : L : R[x]fe R is linear, L(l) = 1, L{Pk{gi,.. .,gr)) C [0, oo)}, 

for sufficiently large k G N. Lasserre proved that (a^), {ID are increasing sequences 
that converge to /* and a^. < l^. < f* for fc £ N. 

We obtain a version of the Lasserre theorem for 

K^k{g,gi, ■.■,gr) ■■= {co + + (p{gi)gi H-h ifiigDgr e /C(g, gi,... ,gr) : 

degao,degaig,degg^ip{gi) < k}, fc £ N. 

The implication (10.21) allows us to apply the Lasserre algorithm of minimizing poly¬ 
nomials on basic compact semialgebraic sets by using /Cfe(g,gi,... ,gr) instead of 
Pfc(g, gi,..., gD (see Remark l2.2|) . Consideration of the cones /Cfe(g, gi,..., gD po¬ 
tentially simplifies the problem of minimizing polynomials on the set X, since these 
cones are properly contained in Pk{g,gi ,... ,gr)- 

In Proposition [531 ^6 present another method of minimizing a polynomial / on 
a compact basic semialgebraic set X, say X G {x G R" : |x| < R}. Namely, for any 
e > 0, we give an effective procedure for calculating a polynomial h G ‘h(gi, ■ ■ • ,gr) 
such that 

V|y|<R f{y) - h{y) > f{x) - h{x) - e, 
and |h(x)| < e for x G X. In particular, 

r-2e< inf{/(y) - h{y) : |g| < R} < f* + 2e. 

Thus, the problem of approximate minimization of / can be reduced to the simpler 
case when the set X is described by one inequality — |xp > 0 (see Remark 12.511 . 
In this case M. Schweighofer [3^ gave the rate of convergence of the sequence 

al* := sup{a gR:/ — h — a£ Pk{R^ — |gP)} —t /**, as fc —>• oo, 
where f** := inf{/(g) - h{y) : \y\ < R}. 

0.4. Convexifying positive polynomials. We will prove Theorem ih. 51 which, we 
believe, is of independent interest: for any polynomial / positive on a convex closed 
set X, whose leading form is strictly positive in R" \ {0}, there exists Nq G N such 
that for any integer N > Nq the polynomial ipn{x) = (1 -I- |xp)^/(x) is a strictly 
convex function on X. In the case of homogeneous polynomials and X = R" the 
same result was obtained by Reznick [251 Theorem 4.6], [24l Theorem 3.12]. 

First in Section [3] we consider the univariate case, and we give an explicit bound 
for Nq in terms of the coefficients of / and the infimum /*. We also give an example 
to show that Nq cannot be a function of the degree of / alone. 

In Section|5]we prove that the convexity at infinity of (Pn{x) = (1 -I- |xp)^/(x) 
for sufficiently large N is equivalent to the strict positivity of the leading form 
of / IProDOsition 15.31) . Moreover, in Corollary 15.81 we obtain an interpretation of 
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Reznick’s result |241 Theorem 3.12] in terms of convexity. As a consequence of 
Theorem 15 . 51 we prove in Corollary 15.71 that, if X is a convex set containing at least 
two points, and d > deg / is an even integer, then the following conditions are 
equivalent: 

(i) / is nonnegative on X, 

(iii) for any a, 6 > 0 there exists Nq G N such that for any integer N > Nq 
the polynomial (Pn{x) = (1 + \x\'^)^{f {x) + a|x|'^ + 6) is a strictly convex 
function on X. 

Finally, we propose the following algorithm. Given a compact convex semial- 
gebraic set X and a polynomial / : M" —>■ R, assume that / is positive on X. 
Then by our convexification result, there exists an integer N such that ip]\[^^(x) := 
(1 + It — ^P)^/(t) is a convex function for any ^ G X. (Actually one can take 
N = 6.) Choose any ag G X, and then by induction set := argminx <PN,a„-i- 
In Theorem 16.51 we state that the limit a* = limi,_^oo Ui/ exists; moreover, a* is a 
critical point of / on AT. The proof requires subtle estimates for the lengths of 
gradient trajectories of / on Ai. Since the set of critical values is finite, this result 
gives a method for finding the minimum of / on A". 

1. Approximation of nonnegative polynomials 

Let X C R" be a closed basic semialgebraic set defined by gi,gr G R[t], i.e. 
of the form (10.1|) . 

Theorem 1.1. Let f G R[t] be nonnegative on the set X. Then there exists a 
sequence G P{gi, ■ ■ ■ ,gr), iz £ N, that is uniformly convergent to f on compact 
subsets. Moreover, fy can be chosen from the cone S(gi,..., gr) 0 . In particular fi, 
converges to f in the li-norm. 

Proof. Take any positive constants £,6,A,B. By the Weierstrass Approximation 
Theorem there exists a polynomial ipe.s.A.B £ R[^] such that 

(1.1) (ps,s,A,B{t) > B for t £ [-A,-(5], 

(1.2) Te,S,A,B{t) <£ for t £ [0, A]. 

Taking ^ a b necessary, we may additionally assume that 

(1.3) Pe,5,A.B{t) >0 for t £ R. 

Set 

9i.£,5.A^B ■— 9i * Pe.S.A,B ^ 9i fol" ^ — 1, . . . , T. 

Every nonnegative univariate polynomial is a sum of squares of polynomials, hence 
by (1131) we have 

(1.4) Pe,S,A,B o gi for i = l,...,r. 

Since the sequence /ij, = / + i, p £ N, uniformly converges to /, we may assume 
that / is positive on X. 

Fix an arbitrary i? > 1 and let M > 1 be a constant such that 

(1.5) f{x) > —M for X G [— 


^Recall that S(ffi,..., gr) = {rro + ¥’( 31)31 H-1 </’(3r)3r- : cq G gy e 
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Since / is positive on X, we have 

xn[-R,Rr c Gi, 

where the set Gi := {x € [—i?, i?]" : f{x) > 0} is open in [—i?, i?]". As An[—i?, i?]” 
is a compact set, there exists ry > 0 such that 

G 2 := {x G [—i?, i?]" : dist(x, X) < 77 } C Gi. 

Since [—i?,-R]"\G 2 = {x € [—R,R]^ : dist(x,X) > 77 } is also compact, by the 
definition of X there exists i5 G (0,1] such that 

(1.6) G 3 := {x G [—i?, i?]" : gi{x) > —d for i = 1,..., r} C G 2 . 

Let 

/fi := min{/(a;) : x G G 2 }. 

Obviously /^ > 0. 

Let A > 1 be a constant such that 


Take 

(1.7) 


\9i{^)\ ^ ^ for X € [—R: z = 1 ,..., r. 


e 


fn 


(r + 1 )A’ 


B — A 


M + re 

6 


Lemma 1.2. For any x G [—R, i?]" we have f{x) — 9i,e,s,A,B{x) > 0. 


Proof. Take x G [—R, i?]". 

If a; G A, then gi{x) > 0 for i = 1,..., r, and by 

f* 

gi,e,s,A,B{x) = gi{x) ■ ipe,s,A,B o igi{x)) < As < ^ for 7 = 1,..., r. 

r 

So 

fix) - ^gi,e,5,A,B{x) > fn- r^ > 0 , 
i=l 

and the assertion holds. 

Let X G G^\X. Without loss of generality we may assume that 
gi[x),...,g},{x)>Q and gk+i{x),..., gr{x) < Q 
for some 0 < fc < r. Then by (II.2L 

fnp. 

gi,e,s,A,B{x) < Ae < — for z = 1,..., fe, 
r 

and by (11.31) . 

gi,e,S,A,B{.x) <0 for Z = fc + 1, . . . ,7’. 

Consequently, f{x) — 9i,e,s,A,B(x) > ffi — > 0, and the assertion holds. 

Let now x G [—i?, i?]" \ G 3 . Without loss of generality we may assume that 

9i{x), ■ ■ ■, 9 k{x) >0, 0 > 9k+i{x), ■. ■,9i{x) > S, 9i+i{x), ■ ■ ■, 9 r{x) < -S, 

where 0 < k < I < r. Then 

f* 

9i,e.S,A,Bix) < -^ for Z = 1, . . . , fc, 

r + 1 

and 

9i,e,5,A,Bix) < 0 for Z = fc + 1, . . . , L 
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By mi we see that 

9i,e,s,A,B{x) < A{-M - re) < -M - for i = / + 1,..., r. 

r + 1 

Summing up, 

fix) -'^gi,e,s,A,B{x) > -M - + {r-l) (m + > 0, 

^ r + 1 \ r + ly 

as desired. □ 


Remark 1.3. The polynomial (fie,s,A,sit) in the above proof can be chosen of the 
form 



with 7Vlog(l — < loge, iVlog(^)^ < log£ and 7Vlog(l + > logR. 

M. Schweighofer EH Lemma 2.3] in a similar problem proposes a polynomial ip of 
the form (f{t) = as{at — 1)'^^ for some s G N and a > 0. 

By Lemma [T21 for any i? > 0 there exists (fji G such that 

r 

fix) - X] ‘^Ri9iix))gi{x) >0 for X G [-i?, i?]”. 

By Fact [T] in the Introduction, it is easy to see that f{x) — 9 ^R{ 9 i{x))gi{x) 

can be approximated in the Zi-norm by sums of squares of polynomials and it 
can be approximated uniformly on [—by sums of squares of polynomials. 
Consequently, / can be approximated uniformly on [—(in particular in the 
Zi-norm) by polynomials from the cone 5 ( 51 ,..., Hence we deduce the assertion 
of Theorem 11.11 □ 


2. Quantitative aspects of Theorem 11.11 

In order to estimate the rate of convergence in Lasserre’s relaxation method m 
we show how to bound the degree of the polynomial ip in Theorem 11.11 The key 
point is to find a lower bound for S which satisfies the inclusion (11.61) . 

Assume now that A is a compact set of the form 

(2.1) A = {x G : gi{x) > 0,... , 3 ,(x) > 0}, 

where gi,... ,gr G K[x]. Choose i? > 0 large enough so that 170 ( 2 ;) = — |xp is 

nonnegative polynomial on A. We now define a cone 

r 

K.{go,...,gr) ■■= +(Tigo + '^ip{,gi)gi : cro,cri G y]K[x]^, ip G 

i=l 

By the argument in the proof of Theorem 11.11 we obtain 
Corollary 2.1. If f G M[x] is strictly positive on the set X, then f G IC(go,... ,gr)- 
Proof. By Lemma lOl there exists ip G such that 

r 

h{x) = /(x) - p{gi{x))g,{x) > 0 

i=l 
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for \x\ < R. Since {x € K” : |x| < i?} = {x S R" : g[){x) > 0}, Putinar’s Posi- 
tivstellensatz (or Schmiidgen’s Positivstellensatz, because P{go) = T{go)) yields 

/i e |cro + CTiffo : o-o,cri € ^R[x]^| = }C{go), 
which completes the proof. □ 

Corollary 12.11 also follows from Schweighofer’s result |31l Lemma 2.3] and the 
Putinar theorem. 


Remark 2.2. We may use the Lasserre algorithm for minimization of a polyno¬ 
mial / on a compact basic semialgebraic set X by using ]C{g, gi,..., gr) instead 
of P{g, gi,..., gr). In fact, we can use the set }Ck{g, gi, - ■ ■, gr) consisting of all 
CTo + o-ig + YJi=i ^{9i)gi S IC{g,gi, ...,gr) such that degdo < k, deg dig < k and 
degip{gi)gi < k for i = 1,... ,r. Consider the following optimization problems: 

• maximize a S M such that f — a € }Ck{g,gi, ■ ■ ■ ,gr), 

• minimize L{f) for L : M[x]fc —>■ R, linear, L{1) = 1, L{lCk{g,gi, ■ ■ ■ ^gr)) C 
[0,oo). 

Denote 

ul := sup{a e R : / - a G ICkig,gi, ■ ■ ■,gr)}, 

vl := inf{L(/) : L : R[x]fc -)> R is linear, L(l) = 1, L{ICk{g,gi, ■ ■ ■ ,ffr)) C [0, oo)}, 

for sufficiently large fc G N. We see that (u^), (u^) are increasing sequences that 
converge to /* (by Corollarv l2.1l) and < v^. < f* for /c G N. □ 


2.1. Quantitative Lojasiewicz inequality. Let gi,...,gr G R[x], and let G : 

R" —>• R be defined by 

(2.2) G(x) =max{0,-gi(x),...,-gr(a:)}, x G R”. 

Then X = {x G R" : gi(x) > 0,... ,gr{x) > 0} = G“^(0). Moreover, 


where Yq = X x {0}, 
Y, = {(x,y) G H 


graph G = Fo U Yi U • • • U W, 


X R : y = -gi{x), gi{x) < 0, gi{x) < gj{x) for j ^ i}. 


for i = 1,..., r. Note that each set F^, i = 0,..., r, is defined by r inequalities 
and one equation. Let d = maxjdegyi,... ,deg 3 i.}. We now state the well-known 
Lojasiewicz inequality in a quantitative version proved in m Corollary 2.3] (see 
also m Corollary 10]): there exist G, £ > 0 such that 

..... ........ 

(2-3) G(x) > C (TrrFpr 

with 


X G 


(2.4) £ < d{6d - 

It follows from (|2.3I) that for every p > 0 there exists Gp > 0 such that 

(2.5) G(x) > Gp dist(x,X)^ for any x G i?(p), 

where B{p) = {x G R" : |x| < p}. Fix R > 0 such that X C B{R). Assume that 
(12.51) holds with fixed C' = Cr and £. 
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Fact 2 . Let 77 > 0. Set Sq = Crj^. Then for any 0 < S < 60 , 

{x G B{R) : gi{x) > —S for i = 1,..., r} C {x G B{R) : dist(x, X) < 77 }. 

Indeed, take x € B{R) \ X such that gi{x) > —S for 7 = 0,..., r. Let G be the 
function defined by (12.21) . Hence by (12.5L 

6 > max{—gi(x),. .., —gr(x)} = G(x) > C'dist(x,X)^. 

Thus for 0 < 5 < i5o we deduce the assertion of Fact m □ 


2 . 2 . Approximation. For i/ = (z/i,..., p„) G N” we set \v\ = vi + ■ ■ ■ + i^n and 
a'^ = ■ ■ ■ a'ff, where a = (ai,..., a„) G K". For h G M[x] of the form 

d 

h(x) = X! X! 

J=0 |y|=j 

we define 

d d 

A{h,R) M{h,R) ='^Y1 for ^ > 0- 

i=o H=j i=i H=j 


Then for x G B{R) we have \h{x)\ < A{h,R) and by the Euler formula for 
homogeneous functions, |V/i(x)| < M{h,R). 

Using a similar argument to the one for Theorem ll.il we obtain the following 


Proposition 2.3. Let f G ]R[x], let X be a semialgebraic set of the form jniD sueh 
that X C B[R), i? > 0, and let gi,...,gr G ]R[x] be polynomials satisfying (12.51) 
with fixed C, C > 0. Take AI, A G R such that 

M > max{l,A(/,i?),B(/, i?)}, A > max{l, A(( 7 i, i?)} for i = 1,... ,r. 


Take e > 0, and set 


where 


, . , 1 5 


2N 


r. e . f. e .T {(r-l)A-l A{2M^l-5) 2rA-e\ 


Then the function 


Kx) = ^‘p{gi{x))g,{x) G ^{gi,...,gr) 

i=l 

satisfies the following conditions: 


(2.6) 0 < h{x) < e for x G X, 

(2.7) ^|y|<R fiy) - Kv) > fix) - h{x) - e. 


Proof. It is easy to see that for the function 
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where 0 < S < A, N > ^ , N > have 

(2.8) < 2]\f^ for t e [0, A], 

(2.9) ^(f) < -2M - for t < -5. 

From the assumptions of M and A we have |/(a;)| < M, |V/(a;)| < M and 
\gi{x)\ < A for i = 1,...,r and x € M" such that |a;| < R. 

Take any e > 0. Let 

y := {y e K" : |y| < i? A 3,ex f{y) > f{x) - e/2}, 

Yi := {y € R" : |y| < i? A dist(y,X) < yj. 

By the Mean Value Theorem, Yi C Y. From Fact [21 for 0 < ^ < Cr]^ we have 
Y 2 := {x £ M" : |a;| < i? A gi{x) > —S for i = 0,..., r} C Yi C Y 

Obviously h(x) > 0 for a; £ V. Since h(x) = (/>(gi(x)) and gi(x) £ [0,A] 

for x £ V, by (12.81) and the assumption N > obtain (12.6F 

Now we prove (1^ . Obviously it holds for y £ V. 

Take y G Y 2 \X. Without loss of generality we may assume that 

gi{y),---,gk{y)>0 and gk+i{y),...,gr{y) <0 
for some 0 < k < r. Then there exists x G X such that /(y) > f{x) — |. So, (12.81) 
and the assumption N > Si^e 

k 

f{y)-Ky) > fix)-^-h{y) > /(x)-|-^(/( 5 *(y)) > f{x)-€ > f{x)-h{x)-e. 

i=l 

This proves (12.71) for y £ I 2 \ 

Let now y G {x G K" : |x| < i?, x ^ 12 }- Without loss of generality we may 
assume that 

gi{y),---,gk{y) > o, O > gk+i{y),...,gi{y) > -S, gi+i{y),...,gr{y) < -S, 

where 0 < k < I < r. Then, by the choice of M, the assumption N > 
and (12.9p we see that h{y) < —2M, and so for any x £ X we have 

/(y) - Hy) > -M + 2M > /(x) > /(x) - h{x) > /(x) - h{x) - e. 

This gives (12.7p in the case under consideration and ends the proof. □ 

Remark 2.4. If we assume that yi,..., are y-strongly concave polynomials, i.e., 
giiy) < gi{x) + (y - x, Vyi(x)) - ^|y - x|^ for x,y £ R”, 

where y > 0 and (•, •) is the standard scalar product, then the assertion of Fact|2] 
holds with (5o = y^/r/2. Hence, Proposition 12.31 holds with 0 < d < min {H, 

Remark 2.5. We can use Proposition l2.3l to minimize a polynomial / on a compact 
basic semialgebraic set X. Let X C {x £ R” : |x| < i?}. Then for any e > 0, we can 
effectively compute a polynomial h{x) = Ei=i‘/^(yi(^))ffj(^)! where tp £ ^R[t]^, 
such that 


r-2e< inf{/(y) - h{y) : |y| < R} < f* + 2e. 
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To approximate /*, we can minimize f — h on B{R). To this end we may compute 
al* := sup{a gR:/— h — aG Pk{R^ — |yP)} for fc G N. 

By the Putinar Theorem (or the Schmiidgen Theorem) we see that 

al* f** as fe ^ oo, 


where f** := ini{f{y) - h{y) : \y\ < R}. 

Minimization oi f — h on B{R) is much simpler than minimizing f on X, be¬ 
cause the set B{R) is described by one inequality — |a;p > 0. In this case 
M. Schweighofer [30] gave the rate of convergence of the sequence al*: 


r - al* < 


</k 


for some constant c G N depending on / and R^ — |yp and some constant d G N 
depending on R^ — |j/p. 


3. Convex polynomials in one variable 


We denote by N* the set of strictly positive integers. In this section x denotes a 
single variable. Let / G M[a;] be a nonzero polynomial. For any iV G N* we define 
the following polynomial: 

(3.1) ipN^x) := {1 + f{x). 


We will find Nq G N* such that for N > Nq the polynomial is strongly convex 
on a closed interval / C R, provided / is positive on I. 

For positive numbers m,R,D we set 


(3.2) Af{m, R, D) := max 


D 

m 


m {1 + R^)D ^ (WKR^I 
16D’ Rm ’ 2m J 


We first prove that if / is a function positive on a bounded interval I, then 
(Pn{x) = (1 -I- x^)^ f(x) is convex for every N sufficiently large. We formulate this 
lemma for functions because restricting to polynomials does not simplify the 
proof considerably. 


Lemma 3.1. Let f be a function positive on an interval I = [a, 6] C R, and let 
R > max{|a|, |6|}. If m, D > 0 satisfy the conditions 

(3.3) m < min{/(a::) : x G I}, 


(3.4) \nx)\<D, \nx)\<D for\x\<R, 

then for any N G N satisfying 

(3.5) N > Afim,R,D) 

we have :p'ff{x) > 0 for x G I, thus ip^ix) is strongly convex on I. 
Proof. Denote Pn = An -t Bn + Qn + T/v, where 

An{x) = AN{N - l)x'^f{x), Bn{x) = 2N{1 + x'^)f{x), 

Qn{x) = 4iV(l -t x'^)xf'{x), Tn{x) = (1 -t x'^)^f"{x). 


Then 

(3.6) 


= (1 + x'^)^ ‘^Pn{x). 
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Let N G N satisfy ()3.5I) . To prove that is convex on I we will proceed in 
several steps. From (j3.3l) and (j3.4|) we obtain 


(3.7) 

An{x) > 4TV(TV — l)x^TO 

for X G /, 

(3.8) 

Bn{x) > 2TV(1 + x^)to 

for X G /, 

(3.9) 

Qn{x) > —4TV(1 + x^) X H 

for X < R, 

(3.10) 

Tn{x) > -{l + x^D 

for X < R. 


Since N satisfy (1531) . we have 
(3.11) 

Note that then 


D m 
TV > — + 


m IQD 


(3.12) 


m 

AD 


< 


Nm — D 

D ■ 


Assume now that x £ I, \x\ < ^. Then obviously AAr(x) > 0. By (13.811 and (13.91) 
we have 

-Bn{x) + Qn{x) > 0. 

Also by (j3.8|) . (j3.10l) and (|3.12p . 

^Bn{x) +Tn{x) > 0. 

So for TV satisfying (13.111) we have P^ix) > 0, and consequently by (13.6L 


(3.13) 


(p'^{x) >0 for X G /, |x| < 


AD' 


We have to show now that Pn{x) > 0 for x G ^ < fy| < R. By (13.51) we have 

((^ j- / 

(3.14) TV > max ■ 


(1 + R^)D AD"^ 

+ 1 ,—-+2 


Rm 




By (|3.7D and (|3.9D we see that 

(3.15) An{x) + QNix) > (—Il|xp + (TV — l)m|x| — D)AN\x\ for x G /, |x| < R, 
and by (13.141) . 

-Dl- 


/ TO m 


\ADJ 


+ (TV-1)to- D>0 

AD 


and 


-DR^ + {N - l)mR -D>0. 


Hence —D\x\'^ + (TV — 1)to|x| — H > 0 for ^ < |x| < i?, and (I3.15|) gives 


An{x) + Qn{x) >0 for X G /, < |x| < R. 


(3.16) 

By (|3.5D we have 

N > 

then, by p.8l) and p.lOl) . we obtain 

(3.17) 


(1 + Afy£> 

2 to ’ 


TO 


Bm{x) + Ti\j{x) > 0 for X G /, < |x| < R. 

Consequently, by (13.161) . (13.171) and (|3.6D . we have 
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Summing up, for N satisfying (13.5|) . by (13.131) and (j3.18l) . we have ^p%{x) > 0, 
X S /, which means that tpN is strongly convex on / and Lemma l3.II is proved. □ 


Remark 3.2. Lemma [3.11 was proved under the assumption that the function / 
is C^. If we assume that / is a polynomial which is positive except possibly at 
0 € K, then an analogous argument leads to a strictly convex function ipf^. More 
precisely, let / S M[x] be a polynomial positive on / = [a, b] except possibly at 
0 € R, where 0 S (a, b). Then there exists Nq £ N such that for any TV € N with 
TV > TVg the polynomial is strictly convex on I. 

For a polynomial of degree d of the form 


d 

(3.19) f = '^aiX^~\ 

i^O 

and i? > 0, we set 


Q-q, . . . , dd ^ ^0 0: 


D{f, R) := max 


d-l 

2=0 


i=0 


We easily see that for any D > D{f,R) the assumption (13.41) of Lemma [3.11 holds. 
If cT > 0, we define 


K(f) = 1 + 2 max 

l<i<d 



Obviously K{f) > 0. It is known that if f{z) = 0, z € C then \z\ < K{f). Since for 
d > 2 the complex zeroes of /' and /" lie in the convex hull of the set of complex 
zeroes of /, 


(3.20) /, /' and /" have no zeroes x G R such that |x| > K{f). 

We prove a version of Lemma 13.11 for a polynomial on an arbitrary interval. 
(A version of this lemma, without explicit bound for TV, has been proven in the 
M.Sc. thesis of 1. Fau [T0].l 


Lemma 3.3. Let f G R[x] be positive on a closed interval / C R. Let m > 0 satisfy 
(13.31) . and let R > K{f) and D > D{f,R) {or let D satisfy (I3.4|) l. Then for any 
integer TV > M{m, R, D) the polynomial ipi\j{x) is strongly convex on L. 


Proof. By the same argument as for (I3.20|) . we deduce that +jv(x) for x < —R has 
the same sign as if'ff{—R). Analogously, (p'fi{R) and (p'ff{x) for x > R have the 
same sign. Moreover, ip'ff{—R) 0 and (p'ff{R) 0. So considering the sign of 
on the intervals Ji = / D [—R, T?], J 2 = / IH [T?, + 00 ) and J-^ = I r\ (— 00 , —R] we 
deduce the assertion by Lemma [3.11 Note that the strong convexity oi is due 
to the fact that f is a polynomial. □ 


Remark 3.4. Under the assumptions of Lemma IXTl and with the same argument, 
we obtain the assertion of this lemma for the function (ppf^^{x) = (1 + (x —^)^)^/(x) 
instead of (pN, where ^ G [—R, R], with the bound TV > Af{m, 2R, D). Hence, the as¬ 
sertion of Lemma l3.3l holds for the function +Ar.{ with the bound TV > Af{m, 2R, D). 

The exponent TV in Lemma 13.31 actually depends on the coefficients of / even 
when the degree of / is fixed. 
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Example 3.5. Let fk{x) = {x — k)^ + 1. Obviously fk is a convex function. We 
have ipn{x) = ((x — + 1)(1 + and LpN{k) = (1 + k"^)^, v^(0) = + 1, 

(/5Ar(|) = + 1)^+^. Assume that is convex. Then 

/ p \N+1 ^ ^ 

(y+i) <^{k^+i)+^{k^+ir. 

So the number N (in Lemma 13.31) such that the function is convex tends to 
infinity as k ^ oo. 

Remark 3.6. By a similar argument to that for Lemmas l3.1l and [3.3l one can prove 
(see HD]): for any / G R[a;] positive on K and any g G R[x] such that g{x) > 0 and 
g"{x) > 0 for X G R there exists Nq G N such that for any N > Nq the polynomial 
fg^ is strictly convex on R. 


4. CONVEXIFYING POLYNOMIALS ON COMPACT SETS 


Let X = (xi,..., x„) be a system of variables and let / G R[x] be a polynomial 
of the form 


(4.1) 

For R> 0 define D(/, R) ■= 




1=0 H=j 


max|l,\/l + R2^ j\a^\R:) \ (1 + jU - l)|a^|R^ 

1=1 kl=l 1=2 kl=l 

This will be a bound for the first and the second derivatives in (|4.5I) below. 


Theorem 4.1. Let / G R[x] be positive on a compact convex set X C R" containing 
at least two points. Set R = max{|x| : x € X}, and let 

(4.2) 0 < m < min{/(x) : x G X}. 

Then for any D > ID)(/, R) and any integer N > Af {m, i?, D) the polynomial 
(Pn{x) = (1 + xf + • • • + xf)^f{x) is strongly convex in X. 


Proof. Let 

A = {(a, /3) G R" X R" : {a, /3) = 0, |/3| = 1}, 

and let 

(4.3) "fa, pit) := Vl + |ap/3t + a. 

Clearly the family of all with (a,/3) G A parametrizes all affine lines in R". 
Denote by C A the set of all {a, fi) G A for which the line parametrized by ja.p 
intersects X. It is easy to see that S is a compact set and 

(4.4) S C {(a,/3) G ^ : |a| < i?}. 

It suffices to prove that for any {a, fi) G B and N > Af{m, R, D) the function fo^^^p 
is strictly convex on la^p = {t G R : ja.pit) G X}. Since A is a compact convex 
set, Ia,p is a compact interval or a point. 

It is obvious that for (a,/3) G B the set {t G R : \^a.pit)\ < i?} is an in¬ 
terval centered at 0 (or a point), say [—i?c(,/ 3 , Ra,^]- Moreover, we have la^p G 
[—Ra,P,Ra,p] C [— i?, i?]. 
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If / is of the form (j4.1L then we easily see that for t G K such that | 7 a,^(t)| < R 
we have 


and 


i(/o7o./3)'wi < E 


|(/0 7a./l)"WI < (1 + ^^)E E 

i=2 k|=i 

SO 

(4.5) \{f O la,p)'{t)\ < D, |(/0 7 „_^)"(t)| < D for t G 

A simple computation gives 

(4.6) l + |7„./5(0P = (l + l«P)(l+i^), 


hence 

O 7a,/3(<) = (1 + |an^(l + f O 


Obviously ipNO"fa,i 3 is a strongly convex function on la^p if and only if the function 
Ia,p 9 t H. (1 + f o 7 a,/ 3 (t) is Strongly convex. Now applying Lemma [3.11 we 
deduce the assertion. □ 


Remark 4.2. Under the assumptions of Theorem l4.ll and with the same argument, 
we obtain the assertion of this theorem for the function ipN^^{x) = (l + |x—^p)^/(x) 
instead of (fN, where ^ G K", with the bound N > Af{m, 2R, D). 


5. Convexity at infinity 


We briefly recall basic definitions. For a function / in an open subset of R 
Hxf stands for the Hessian matrix of / at x. The associated quadratic form 
R” — 7 . R reads 


(5.1) 


hxfiy) = {H^f{y),y). 


Recall that the matrix H^f is said to be positive semidefinite (respectively positive 
definite) if hx{y) > 0 for any y G R" (respectively hxf{y) > 0 for 1 / ^ 0). Set, for 
A C {!,...,n}, 45^0, 


A;g := det 


52 / 


dxidxi 


iJ&E 


Recall a classical fact (Sylvester criterion): 


Lemma 5.1. H^f is positive semidefinite (respectively positive definite) if and only 
if Ag > 0 (respectively A^ > OJ for all nonempty E C {1,..., n}. 

Let / G M[a;] and n > 2. We call / locally convex (respectively locally strictly 
convex or locally strongly convex) in an open set G C R" if any point x G G has 
a convex neighbourhood U C R” such that the restriction /|(7 is convex (respec¬ 
tively strictly convex or strongly convex). In particular / is locally convex in G 
if and only if H^f is positive for any x G G. We say that / is convex at infinity 
(respectively strictly convex at infinity or strongly convex at infinity) if there exists 
i? > 0 such that / is locally convex (respectively locally strictly convex or locally 
strongly convex) in G = {a; G R" : > ^}- analogous terminology will be 

used for concave functions. 
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Let d = deg / > 0 and let fo,... ,fd S K[a;] be homogeneous polynomials such 
that /j = 0 or deg fi = i, and / = /q + • • • + /d. Since d = deg/, we have fd ^ 0. 

Lemma 5.2. If f is convex at infinity, then fd is a convex function. 

Proof. Assume that fd is not convex. Then for some nonempty E G {1,... ,n} and 
xo 0, A^^{xq) < 0. Since A;^ is nonzero it must be a homogeneous polynomial 
of degree k{d — 2), where k is the number of elements of E. Then 

A{,{txo) = t^^'^-^^A{fixo) + E{t) 

with some polynomial F{t) of degree less than k{d— 2). So A^^{txo) < 0 as t —>■ oo, 
hence / is not convex at infinity, which contradicts the assumption. □ 

To obtain the convexity of (p^ we will assume that fd{x) > 0 for a; G R" \ {0}. 
This assumption is natural, as the following proposition shows. 

Proposition 5.3. The following conditions are equivalent: 

(a) fd{x) > 0 /or a; G R” \ { 0 }, 

(b) there exist R > 0 and Nq G N such that for any integer N > Nq the 
polynomial 

ipN{x) = (1 + a;^ + • • • + xl)^f{x) 

is locally strongly convex on G = {a; G R” : |a;| > i?}, 

(c) there exists Nq G N such that for any integer N > TVq the polynomial 
is convex at infinity. 

Proof, (a)^(b). We use the notations (14.31) of the proof of Theorem 14.11 namely 
A = {(a,/3) G R” X R" : |/3| = 1, (a, fi) = 0} and -fa,p{t) := ^/TTWfit + a- We 
shall use a convenient renormalization of / o ja.p. For (a, /?) G A we set 

(5.2) 9 a,pit) := (^1 + |q;P)"‘*/ o 7a,/3(t). 

The next crucial lemma gives an estimate on the size of the coefficients of 

d 

foja,pit) = 

Lemma 5.4, There exists a constant C > 0 such that for any (a, /3) G A, 

(5.3) \ciia,fi)\ < G(\/l + fori = 0,...,d. 

Proof. It is enough to check the assertion for a monomial ax^^ ■ ■ ■ x)fi with ki + 

• • • “h kfi d, EH 

Write ga,pit) = (\/l + \a\'^)~'^foja,pit) = Lemma EH yields 

a uniform estimate for the coefficients: 

\ai{a,fi)\<C, i = 0,...,d. 

By the assumption that fdix) > 0 for a: 7 ^ 0 it follows that 
aoia,fi) = fdifi) > inf fd{x) = e > 0 , 

|a:| = l 
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so for if = 1 + 2 C/e we have if > 1 + 2 maxi=i_...^d I ao(a’J) Take 

R> K and let 

d-l d-2 

ii >max{l, i)(d-z - l)i?‘^-*- 2 |^ 

2—0 z —0 

Then g'^ ^{t) < D and 5 " p{t) < ii for i £ [—i?, ii]. Again by the assumption that 
fd{x) > 0 for X 0, one can assume that there exists m > 0 such that for |x| > R 
we have f{x) > + \x\^Y. So 

ga,pit) > + | 7 a,/ 3 (t)p) > m for |7a,/3(0l > R- 

To end the proof of the implication (a)=^>(b) it is enough to apply Lemma 13.31 
The implication (b)=>(c) is trivial. 

(c)^(a). Observe that 

(5.4) fd{x) >0 for X G R”. 

Indeed, suppose there exists xq G M" \ {0} such that fd{xo) < 0. Let i G R, i > 0, 
be such that ixo G G. Since f^i is the leading form of /, we may assume that 
f{txo) < 0. Let H C G be a compact convex neighbourhood of txo such that 
/(x) < 0 for X G ii. By Theorem 14.11 there exists Nq G N such that for N > Nq 
the polynomial (pN strictly concave on H. This contradicts (c) and gives (15.41) . 

Assume to the contrary that (a) fails. Then by (15.41) . /^^(O) ^ {0}. The leading 
form of (/jtv is equal to ipNix) = (xf+ • • •+x^)^/d(x) and by Lemma \E7I \ this form is 
convex. So 00 , 0]) is a convex set, and by (15.411 . so is /^^(O) = /^^((— 00 ,0]). 

Consequently, the level set /^^(O) is a linear subspace, say of dimension k > 0 (since 
fd is a homogeneous polynomial). By choosing a suitable coordinate system, we 
may assume that '0)^^(O) = /^^(O) = R^ x {0}. Since fd 0, we have k < n. As 
V'Af Ir''+i x{o} k + 1 < n is also a convex function, we may assume that n = fc +1, 
and moreover that n = 2 and k = 1. Then 

/d(xi,X 2 ) = X 2 /(xi,X 2 ) 

for some s G N* and a homogeneous polynomial / such that /(xi,X 2 ) > 0 for 
(xi,X2) G R^ \ {0}, and ^Ar(xi,X2) = X2(xf + X2)^/(xi,X2). Observe that for 
'0Ar(xi,X 2 ) = 1 we have X 2 —>■ 0 as xi —>■ 00 or xi —>■ — 00 . Consequently, the set 
'0)^^((— 00 ,1]) is not convex, which contradicts the convexity of tpN. This gives (a) 
and ends the proof of (c)=)>(a). The proof of Proposition 15.31 is complete. □ 

Theorem 5.5. Let X C R" be a convex closed set. Assume that f is positive on X, 

(5.5) /-i(0) = {0} 
and there exists m G R such that 

(5.6) 0 < m < inf{/(x) : x G X}. 

Then there exists Nq G N such that for any integer N > Nq the polynomial ip^ (x) = 
(1 + xf + • • • + x^)^/(x) is strongly convex on X. 

Proof. If fd{x) < 0 for some x 0, then A is a compact set and the assertion 
follows from Theorem 14.II 

Assume that fd{x) > 0 for any x 0. If A is a bounded set, then the assertion 
immediately follows from Theorem 14.11 So assume that A is unbounded. Since 
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fd{x) > 0 for X 0, by Proposition 15.31 there are R > 0 and Ni G N such that 
for N > Ni the polynomial ipN is strongly (locally) convex in {x G M" : |x| > i?}. 
By Theorem 14.11 one can assume that for N > Ni, the polynomial is strongly 
convex on {x G X : |x| < i? + 1}. Summing up, for N > Ni the polynomial ip^ is 
strongly convex on X. □ 

Remark 5.6. U X = R" then, for any N large enough, ipn{x) is not only strictly 
convex, but it is a sum of squares of polynomials. More precisely, if / satisfies 
the assumptions of Theorem 15.51 with X = R", then its homogenization, denoted 
by p, satisfies the assumption of Reznick’s theorem |241 Theorem 3.12]. So after 
dehomogenization of (xg +xf + • • • + x^)^p(x) we see that our function is a sum 
of even powers of afhne functions. Hence is convex and it is a sum of squares 
of polynomials. However, this method cannot be applied if X is a proper subset 
of R”. 

Corollary 5.7. Let X C R" be a closed convex semialgehraic set containing at least 
two points, let f G R[x], and let d > deg/ be an even integer. Then the following 
conditions are equivalent: 

(1) f is nonnegative on X, 

(2) for any a,b > 0 there exists iVg G N such that for any integer N > iVg 
the polynomial ip^ix) = (1 + |xp)^(/(x) + a|x|'^ + 6) is a strongly convex 
function on X. 

Proof. The polynomial /(x) + alxj"^ + b satisfies the assumptions of Theorem 15.51 
if a, 6 > 0. Hence the implication (1)=^>(2) follows from Theorem 15.51 To prove 
the converse assume that /(xq) < 0 for some xq G IntX. Note that X, being 
convex and containing at least two points, has nonempty (relative) interior. Then 
for sufficiently small a, b and N large enough, the function —pN is strictly convex 
in a neighbourhood of xq. So pn is strongly concave in a neighbourhood of xq, 
which is absurd. □ 

For homogeneous polynomials on R" we obtain the following extension of Rez¬ 
nick’s result mentioned in the Introduction. For a fixed / G R[x] and a positive 
integer N, we set tpNix) := (x^ H-+ x^)^/(x). 

Corollary 5.8. Let f G R[x] be a nonzero homogeneous polynomial. The following 
conditions are equivalent: 

(a) /(x) > 0 for X G R" \ {0}, 

(b) there exists Ni G N such that for any N > Ni the polynomial ^at is a sum 
of even powers of linear functions, 

(c) there exists N 2 G N such that for any N > N 2 the polynomial ip^ is a 
convex function, 

(d) there exists N 3 G N such that for any N > N 3 the polynomial ipi,; is a 
strictly convex function. 

Proof. The implication (a)=>(b) is Reznick’s result (see [241 Theorem 3.12]). The 
implications (b)=^(c) and (d)=>(c) are trivial. The implication (c)=>(a) follows by 
the same argument as (c)=>(a) in Proposition 15.31 

To complete the proof it suffices to prove (a)=>(d). We will investigate the 
convexity of ipN on each line I in R". If 0 G / then clearly iPn\i is convex, so we 
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will check the convexity of on lines I C M" \ {0}. Since / is homogeneous, it 
suffices to consider the convexity oi tpN on lines of the form 

I = {a + bt :t G R}, (a, b) G A, 

where A := {{a,b) G R" x R" : |a| = |6| = 1, {a,b) = 0}. Clearly A is compact. 

Denote g{t, a, b) = f{a + bt) for t G R, (a, b) G A. Then 

g{t, a, b) = go{a, b)t^ + gi{a, H-h gd{a, b), 

and go{a,b) = f{b). So by (a) there exists m > 0 such that gQ{a,b) > m for 

(a, b) G A. Moreover, f{x) > m for x G R", |x| > 1, hence 

g{t, a,b)>m for t G R and (a, b) G A. 

Take R, D G R such that 

gi(a,b) 

go{a, b) 

and 


i? > 1 + 2 max max 

{a,b)GA l<i<d 


d -1 

D> max max |l, y^((i — i)|gi(a, 
ia,b)eA I ^ 

d -2 

Y^{d-i){d-z-l)\g,{a,b)\R<^-^-^}. 

i =0 


Since for (a, b) G A, 

il^Ni.a + bt) = (1 + g{t, a, 6), 

Lemma [331 implies that i/ijv is a strictly convex function provided N > Af{m, R, D). 
This gives the implication (a)=;>(d) and completes the proof. □ 


Remark 5.9. Let 7Vi,iV2,A^3 be the minimal values in Corollary 15.81 Obviously 
N 2 < iVi and N 2 < N^. It is not clear to the authors whether the equalities 
Ni = N 2 = N 3 hold. 

By a result of Blekherman 0, m there exist strictly convex positive forms that 
are not sums of squares. However, this does not answer our question, because we are 
interested in the smallest numbers Ni such that for every N > Ni the polynomials 
ijjN are respectively: sum of even powers of linear functions, convex and strictly 
convex. Note that multiplying a convex form by {xf + ■ ■ ■ + x'^)^ may produce 
a nonconvex form. 

For instance the polynomial f{x, y) = {x — ky)^ + is a strictly convex sum of 
squares of linear forms. However for sufficiently large k we can find TV > 1 such that 
the polynomial i/jat is not convex (cf. Example 13.51) and consequently not a sum of 
even powers of linear functions. 


6. A proximity algorithm for a polynomial ON A CONVEX SET 

Let X C R" be a compact convex semialgebraic set. We consider a polynomial 
/ restricted to X. We propose an algorithm, based on our convexification method, 
which produces a sequence converging to a critical point of / on A. 

Using a translation and a dilatation we may assume that X is contained in a 
ball of radius 1/2. Replacing / by / + c, where c is a constant large enough we 
may assume that m = inf{/(a;) : x G X} = D > 0, where D is a bound for the 
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absolute value of the first and the second directional derivatives of / (along vectors 
of norm 1). Indeed, we may increase D in such a way that |/(a;)| < D for x € X. 
Then we put c = 2D, hence f{x) + c > D tor x G X. Since now m = D and 2R = 1, 
by p.2l) we have N{m, 2R, D) = 6. 

By Remark 14.21 with N = 6 and some ^ > 0 the function 

■.= {l + \x-^\Yf{x) 

is /r-strongly convex on X for any ^ G X. This means that 

( 6 . 1 ) (pN,^{y) > + {y - x,V(pN,^{x)) + ^\y - x\'^ ioix,yGX. 

Recall that any strictly convex, hence in particular any strongly convex, function ip 
on a convex closed set X admits a unique point, denoted by argmin^ Lp, at which 
Lp attains its minimum on X. 

Choose an arbitrary point ao G X, and by induction set 

(6.2) Oj. := argminjf ipN,a„.i- 

Lemma 6.1. For any v G N we have 

\ay+i - aj.| = dist(ay, /“^(/(oy+i)) n X). 

Proof. If \a' — < |ai,+i — ay\ for some a' G /“^(/(oiy+i)) fl X, then by the defi¬ 
nition of PN.ai, we have (^7v,a„(a0 < (oiz+i), which contradicts the definition 

of aj/+i. So, |aj/+i — Oi/I < dist(ai,,/“^(/(a,y+i)) flX). The opposite inequality is 
obvious. □ 


Lemma 6.2. For any p G N we have 


f{au+i) < 


f{a„) - f |Oi.+i - 

(1 -f |ai/+i — 


In particular the sequence f{ai,) is decreasing. 


Proof. Since (pN,a^ is strongly convex, the definition of implies that the func¬ 
tion 

[0,1] 9 t FN,a„{ai, F t(aj.+i - a^)) 

decreases, so {av+i — av,XtpN,a,,[au+i)) < 0. Thus, by (|6.1I1 we see that 

FN.aAO'u) > PN,aAO'V+l) + ^\ai, - Oj^+ip. 

Again, by the definition of PN,a„ we have 

fM > (1 + \a„+i - a^n^/(ai.+i) -f ^\ai,+i - 

This ends the proof of the lemma. □ 

Now we estimate from below the length of — a,y+i|, i.e., of the step in our 
sequence. It is enough to consider only the one-dimensional case with Oi, = 0. By 
a direct computation we obtain: 

Lemma 6.3. Let / : [0, ry] —>■ M 6e a function such that 0 </ < C and f < —tj 

on [ 0 , 77 ] for some C > ^ and 77 > 0. Assume that <pm{x) = {1+x'^)^ f{x) is strictly 

2 

convex on [ 0 , 77 ]. Then hi := argmin^Q^^j pfq > Hence /(O) — /(&i) > 2 at?- 
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Let / be a function in a neighborhood C/ of a closed set X C M". Recall that 
a G X is a lower critical point of f on X if 

(6.3) (V/(a), X — a) > 0 for x € X in a neighbourhood of a. 

We denote by Sjf / the set of lower critical points of / on X, and by S/ := {x € 
U : V/(x) = 0} the set of ordinary critical points of /. The following proposition 
recalls all the necessary properties of these sets. 

Proposition 6.4. Assume that X C M" is closed and f : R" —?► R is a function. 
Then: 

(1) xns/c Sx/; 

(2) if f restricted to X has a local minimum at a, then a G Sx/; 

(3) if M G X is a smooth manifold and a € M n Sx/, then for any z € TaM, 

(V/(a),2) =0; 

(4) if f is a polynomial and X is semialgehraic, then T,xf is a semialgehraic 
set and /(Sx/) is a finite set. 

Proof. The first three statements follow immediately from the definition. If / is 
a polynomial and X is semialgehraic then the set Sx/ is described by a first 
order formula (in the language of ordered fields) so it is semialgehraic as well (see 
e.g. [71 Chapter 2]). Hence Sx/ has finitely many connected components (in fact 
connected by piecewise semialgehraic arcs). Each such component is a finite 
union of smooth manifolds, hence by condition (3) the function / is constant on it. 
So /(Ex/) is a finite set. □ 

Theorem 6.5. Let X C R" be a compact convex semialgehraic set and f : R" —>■ R 
a positive polynomial on X. Let ai, be the sequence defined by (lOl) with oq € X. 
Then the limit 

a* = lim Gi, 

lA—>00 

exists, and a* € Ex/. 

Remark 6.6. Note that Lemmas 16. H and l6.2l hold true for any function of class 
in a neighborhood of X. However, they are not sufficient to prove the convergence 
of the sequence a,, (at least we have not been able to do this). For / polynomial the 
convergence of the sequence will follow from some fine properties of the gradient 
trajectories of polynomials. 

Proof of Theorem \6.5[ First, assuming that a* = limy_>oo exists, we shall prove 

that a* € Ex/- Suppose that a* ^ Ex/, so there exists x € X with (Vf(a*),x — 

a*} < 0. Then there exists rj > 0 such that (V/(a* + t{x — a*)),x — a*) < —rj for 

t € [ 0 , 17 ]. By continuity the same holds with a* replaced by for sufficiently 

2 

close to a*. Moreover, we may assume that |/(ay) — /(a*)| < 2 xt?> '"^^ere C > f{x) 
for X G X. Hence by continuity and Lemma 16.31 we obtain /(a^+i) < /(a*), which 
is a contradiction. 

Recall now the Comparison Principle [1] Lemma 4.2]. Let / : R” —R be a 
polynomial and let M C R" be a smooth bounded semialgebraic set. Let V/(x) 
denote the gradient of / with respect to the standard Euclidean scalar product, 
and VMf{x) its projection on TxM, the tangent space to M at x. 

Let Fm C M be a semialgebraic curve meeting each level set of / and such that 
for every point y e Fm we have |VM/(y)| < |VM/(a:)| for all x £ f~^{f{y)) n M. 
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By standard arguments (semialgebraic choice) such a curve always exists; it is called 
a talweg or a ridge-valley line of f in X. Then the following lemma holds. 

Lemma 6.7 (Comparison Principle). For every pair of values a < b taken by f, the 
length of any trajectory ofS/Mf lyin-g /~^((®i^)) H M is hounded by the length 
o/Pm n f~^{{a,b)). 

To prove that lim^-^oo exists recall first that by Lemma 16.21 we have 
/(oi/) > /(aj.+i) >•••>/*:= lim f{a„). 

V'—^OO 

By Proposition 16.41 41 the set f{T,xf) of critical values of / on X is finite, so we 
may assume that either the sequence f{a^) is eventually constant, or (/(oj/), /») fl 
/(Sx/) = 0 for p large enough. Clearly in the hrst case, by Lemma [6.21 also the 
sequence Ui, is eventually constant. So we assume from now on that the sequence 
/(ttj.) is strictly decreasing and (/(ao), /*) C /(Ex/) = 0- 

The set X is semialgebraic, so there exists a stratification X = Mi, i.e., a 
finite disjoint union of connected smooth semialgebraic sets, called strata. Moreover 
Mi \ Mi is a union of some of the Mj’s of dimension smaller than dim Mi (cf. [31 
Chapter 2]). We can refine this stratification in such a way that / is of constant 
rank on each Mi , i G I; then our polynomial / restricted to Mi is either a constant 
or a submersion. Let I* = {i £ I ■. rank/|Mi = 1}; note that Cxf = Uzg/\/* fi^i) 
is a finite set. Since the sequence /(«[/) is strictly decreasing we may assume that 
(/(ao),/*)nC'x/ = 0. 

To each Mi, z G /*, we can associate a semialgebraic curve Pi := Pm; which is a 
talweg of / in Mi. Set P := UiG/* 

Recall that, by Lemma lbTl a,j+i is the point closest to Oi, on the fiber f~^{f{ai, +i)) 
r\X. To estimate —Oi/I we will construct a continuous curve ji, : [t^,, > X 

such that and /( 7 i/(ty+i)) = /(aj/+i). By Lemma [37T] we will then have 

\a^+i — Uj/I < length( 7 i^). The curve will be a piecewise trajectory of —XMif 
(more precisely, of — Vm^Z/IVm;/!)- Hence, by the Comparison Principle, 

|a;.+i -a^\ < length(7i,) < length(r n/"^((/(oi^+i),/(a^,)))). 

Recall that P, being a bounded semialgebraic curve, has finite length (see e.g. |32l 
Corollary 5.2]); therefore 

OO 

^|a,.+i -a^\ < length(rn/~^((/*,/(ao)))) < oo. 

So the series ~ is convergent, which implies that a* = lim,y_>.oo Oi/ 

exists. 

Construction of the curve Assume that belongs to a stratum Mi for some 
i G I*. Let ji, : [tjy,^^) —>■ Mi be a trajectory of Vi := mJ/\XM if\- By a 

trajectory we mean a maximal solution (to the right) of 7 ' = Vi in Mi. Note that 

6 * = lim ^v{s) G Mi \ Mi 
sy'tl 

exists. Indeed, by Lemma 16.71 any maximal trajectory of Vi has finite length so it 
has a limit in XIi. But the vector field Vi does not vanish on M^, hence this limit 
belongs to Mi \ Mi, which is a union of strata of smaller dimension. 

If f{bi) ^ f(.o,v+i) then there exists t^+i G [tu,tl] such that f{j„{t„+i)) = 
/(oiz+i), so 7 i/ restricted to [tv,tv-i-i] is the curve we are looking for. Now if /( 6 *) > 
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/(ai,+i), then h\ G Mi^ for some ii G I* such that dimMij < dimMi. We repeat 
the above construction on starting from the point 5*, then we glue it with 
the previous one. In this way the dimension of the stratum in which our curve 
stays is strictly decreasing, but this dimension is always at least 1. Finally we will 
reach the level f~^{ay+i). Indeed, when our curve arrives at a point in a stratum 
of dimension 1 we follow this stratum until we arrive at the level f~^{ay+i) since 
(/(ao), /*) n Cxf = 0. The estimate 

length( 7 ,.) < length(r n f~^{f{ay+i), f{ay))) 
follows from Comparison Principle. □ 

Remark 6.8, In the case when X is a closed ball (or more generally when X has a 
smooth boundary) the length of the curve P can be effectively estimated (see |T]). 
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